End-to-End Data Solutions for Distributed Petascale Science
نویسندگان
چکیده
Jennifer M. Schopf, Ann Chervenak, Ian Foster, Dan Fraser, Dan Gunter, Nick LeRoy, Brian Tierney 1 Computation Institute, University of Chicago and Argonne National Laboratory 2 Mathematics and Computer Science Division, Argonne National Laboratory 3 Information Sciences Institute, University of Southern California 4 Department of Computer Science, University of Chicago 5 Lawrence Berkeley National Laboratory 6 Department of Computer Science, University of Wisconsin
منابع مشابه
Zest: The Maximum Reliable TBytes/sec/$ for Petascale Systems
3 Abstract PSC has developed a prototype distributed file system infrastructure that vastly accelerates aggregated write bandwidth on large compute platforms. Write bandwidth, more than read bandwidth, is the dominant bottleneck in HPC I/O scenarios due to writing checkpoint data, visualization data and post-processing (multi-stage) data. We have prototyped a scalable solution on the Cray XT3 c...
متن کاملThe CEDPS Troubleshooting Architecture and Deployment on the Open Science Grid
Tracking failures and poor performance across a widely distributed system of resources has proven challenging for many ongoing DOE applications. An example is the Open Science Grid (OSG) project, which currently experiences a roughly 15% job failure rate. This can be an issue not only for Grid computing but for anyone performing large-scale data transfers to remote machines because of the large...
متن کاملReal-time data access monitoring in distributed, multi- petabyte systems
Petascale systems are in existence today and will become common in the next few years. Such systems are inevitably very complex, highly distributed and heterogeneous. Monitoring a petascale system in real-time and understanding its status at any given moment without impacting its performance is a highly intricate task. Common approaches and off-theshelf tools are either unusable, do not scale, ...
متن کاملStork data scheduler: mitigating the data bottleneck in e-Science.
In this paper, we present the Stork data scheduler as a solution for mitigating the data bottleneck in e-Science and data-intensive scientific discovery. Stork focuses on planning, scheduling, monitoring and management of data placement tasks and application-level end-to-end optimization of networked inputs/outputs for petascale distributed e-Science applications. Unlike existing approaches, St...
متن کاملPetascale Research in Earthquake System Science on Blue Waters (PressOn)
Broader Impacts. The Southern California Earthquake Center (SCEC) conducts a broad program of earthquake system science that seeks to develop a predictive understanding of earthquake processes with a practical mission aimed at providing society with improved understanding of seismic hazards. In partnership with earthquake engineers, SCEC researchers are developing the ability to conduct end-to-...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2007